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(57) ABSTRACT 

A method for detecting and discarding stale cells following 
route changes in a data communication network. The data 
communication network comprises a transmitter, which 
upon detection of a failure in a route of a network, retrans- 
mits data tagged as resent data along a different route of a 
multi-stage switch; and a receiver, which upon detection of 
tagged data from the transmitter via the different route of the 
multi-stage switch, utilizes tagged data for data communi- 
cations while discarding previously transmitted data that are 
not tagged to avoid data duplication. 

18 Claims, 4 Drawing Sheets 
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METHOD FOR DETECTION OF STALE 
CELLS FOLLOWING ROUTE CHANGES IN 
A DATA COMMUNICATION 

5 

TECHNICAL HELD 

The present invention relates to a data communication 
network, and more particularly, relates to a method for 
detection of stale cells following route changes in a data 
communication network. 

BACKGROUND 

A data communication network is generally consisted of 
a network of nodes connected by point-to-point links. Each 
link may be a bi-directional communication path between 
two cxjnnect nodes (stations) within the network. Data may 
be transmitted in groups called cells or packets from a source 
to a destination, often through intermediate nodes. In many 
data communication networks, cells (or data packets) 
between two endpoints (e.g., end stations such as computers, 
servers and/or I/O devices) may transverse the network 
along a given path to ensure that cells are delivered in the 
order transmitted and without duplication. A control mecha- 
nism may be used to permit the re-transmissioo of a cell or 
group of cells if the original cell is corrupted in the network 
due to network errors, including, for example, random noise. 

If a particular link between two connect nodes fails within 
the infrastructure of the network, retransmission cells may 
be re-routed by the control mechanism via an alternate link 3Q 
that does not include the failed link (assuming such a link 
exists). After the failure is detected, a new route chosen by 
the control mechanism, duplicate old cells may still exist in 
the network that have not yet transversed the old route. 
These cells may be referred to as stale cells. These stale cells 35 
may interact with cell retransmission which can result in 
delivery of duplicate cells. The duplicate cells must be 
ehminated in order to prevent corruption of data communi- 
cation between two endpoints in the network. 

Currently, there may be two techniques commonly 40 
employed to address the duplicate or stale cell problem. The 
first technique seeks to include a sequence number in each 
cell at the transmitting endpoint. This sequence number must 
be sufBciently large to ensure that a unique number may be 
assigned to every cell that may exist in the network infra- 45 
structure at any given time. Using the sequence number, the 
receiving endpoint may discard any cell that contains a 
sequence number that has been previously -received. The 
major disadvantage of this technique is that, in large data 
communication networks, the overall performance of the 50 
network can be significantly degraded, e.g., significant com- 
puting time can be spent in bookkeeping choirs. Moreover, 
in many data communication networks that support multiple 
and/or simultaneous communication flows between two 
endpoints, the large sequence number can significantly affect 55 
the cost of implementing the context store required for each 
flow. 

The second technique seeks to ensure a quiet period just 
prior to, and for a time period after, the network is config- 
ured with the new route. This quiet period must be suffi- 60 
ciently long to ensure that all cells that were in the network 
infrastructure at the time of the re-route have exited the 
network. During this quiet period, endpoints affected by the 
re-route may be prohibited from injecting new cells into the 
network. There may be two significant limitations to this 65 
technique, however. First, the endpoints of the network must 
be coordinated to prevent injection of new cells into the 
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network. This required participation of the endpoints may 
increase the complexity of the switching algorithm. Second, 
the required quiet time may be unacceptably long, particu- 
larly in large data communication networks which operate at 
significant network speeds. 

Therefore, there is a need for a more flexible, cost- 
effective and performance-efficient technique for recovery 
from faults within a network infrastructure which requires 
neither a quiet period, nor that the transmitting endpoint be 
aware that the route through the network infrastructure has 
changed for data communications. 

SUMMARY 

Accordingly, various embodiments of the present inven- 
tion are directed to a data communication network and a 
method for detection of stale cells following route changes 
in a data communication network. The data conununication 
network comprises a transmitter, which upon detection of a 
failure in a route of a network, retransmits data tagged as 
resent data along a different route; and a receiver, which 
upon detection of tagged data, utilizes tagged data for data 
communications while discarding previously transmitted 
data that arc not tagged to avoid data duphcation, 

BRIEF DESCRIPTION OF THE DRAWINGS 

A more complete appreciation of exemplary embodiments 
of the present invention, and many of the attendant advan- 
tages of the present invention, will become readily apparent 
as the same becomes better understood by reference to the 
following detailed description when considered in conjunc- 
tion with the accompanying drawings in which like refer- 
ence symbols indicate the same or similar components, 
wherein: 

FIGS. lA and IB illustrate an example of a less advan- 
tageous data communication network having several nodes 
interconnected by corresponding links; 

FIG. 2 illustrates an example data in groups of cells for 
communications according to the principles of the present 
invention; 

FIG. 3 illustrates an example redundant data communi- 
cation network having alternate switches for data commu- 
nications between several interconnected nodes via corre- 
sponding links and alternate links according to an 
embodiment of the present invention; 

FIG. 4 illustrates example cascaded switches of the redun- 
dant data communication network as shown in FIG. 3; 

FIG, 5 illustrates an example implementation of a redun- 
dant data communication network using an example input/ 
output (I/O) channel architecture according to an embodi- 
ment of the present invention; and 

FIG. 6 is a block diagram of an example individual switch 
of a multistage switch of an example redundant data com- 
munication network according to an embodiment of the 
present invention. 

DETAILED DESCRIPTION 

The present invention is applicable for use with all types 
of data communication networks designed to link together 
end stations such as computers, servers, peripherals, storage 
devices, and communication devices for data commimica- 
tions. Examples of such data communication networks may 
include a local area network (LAN), a wide area network 
(WAN), a campus area network (CMl), a metropolitan area 
network (MAN), a global area network (GAN) and a system 
area network (SAN), including newly developed data net- 
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works using Next Generation I/O (NGIO) and Future I/O verse. In particular, each cell may correspond to 256 bytes, 

(FIO), now known as InfiniBand™ as set forth in the for example, and contain a header (HDR) including a ceU tag 

"InfiniBand'™ Architecture Specification," the InfiniBand™ which will be described in the following discussion. 

Trade Association on Oct. 24, 2000. and Server Net and Discussion now turns to FIG. 3, which illustrates an 

those networks which may become available as computer 5 example redundant type network having redundancy paths 

technology develops in the future. LAN sy^em may include or alternate links (routes) used in the event of a break of 

Ethernet, FDDI (Fiber Distributed Data Interface) Token some links or components in the network. As shown in FIG. 

Ring LAN, Asynchronous Transfer Mode (ATM) LAN, 3, the example redundant type network 10' may also include 

Fiber Channel, and Wireless LAN. However, for the sake of a second switch 200' (SW2) in addition to the first switch 

simplicity, discussions will concentrate mainly on exem- jq 100' (SWl) and the four dififerent nodes A, B, C, and D 

plary use of a simple data network having several example shown in FIG. 1. The second switch 200' (SW2) may be used 

nodes interconnected by corresponding links, although the to provide a set of redundant paths from source nodes to 

scope of the present invention is not limited thereto. destination nodes, via one or more intermediate nodes, in 

Attention now is directed to the drawings and particularly order to ensure that data are delivered to respective desti- 

to FIGS. LA and IB, an example data communication 15 nation nodes in the event of a break of some links or 

network having several interconnected nodes for data com- components in the network. Accordingly, if a break occurs 

munications is illustrated. As shown in FIG. LA, the data along any one of the link UO between node A and switch 

communication network 10 may include, for example, a 100' (SW 1) as indicated by break #1, or switch SI as 

centralized switch 100 and four different nodes A, B, C, and indicated by break #2, or the link 114 between switch SWl 

D. Each node may correspond to an end station including, 20 ^^^^ ^ indicated by break #3, the data transmitted 

for example, a computer, a server and an input/output (I/O) from a source node to a destination node can be re-routed 

device. The centralized switch 100 may contain switch ports along the redundant path using the second switch 200' 

0, 1, 2, and 3 each connected to a corresponding node of the (SW2). 

four different nodes A, B, C, and D via a corresponding The problem sought to be solved by the present invention 

physical link 110, 112,114, and 116. The centralized switch 25 may not necessarily be a permanent breakage within the 

100 may also include a switch table (FIG. IB) containing redundant network, but instead, a temporary blockage or 

re-routing information in addition to routing information breakage within such network. More particularly, for 

using, for example, explicit routing and/or destination example, assuming that data are to be transmitted from 

address routing. source node A to destination node D, source node A may 

As shown in FIG. IB, the switch table may contain listing 30 contain a predetermined hierarchy or list therein for use to 

of destination node addresses and corresponding destination send data along a preferred path (link). For example, node A 

ports, for example, node D and a corresponding switch port may contain a predetermined listing indicating that data 

3 (i.e., received cells destined for node D are output from should first be transmitted through switch SWl to node D 

switch port 3) and node C and a corresponding switch port and then, if problems occur, the alternate link or path 

2. Since there is no redundancy or redundant link available 35 through switch 200' (SW2) should then be attempted, 

for data communications, a problem such as a temporary or Accordingly, when node A of the redundant type network as 

permanent interruption may occur if a break occurs any- shown in FIG. 3 first begins to transmit data to node D 

where within the data communication network of FIGS. lA through switch SWl, and assuming that a blockage or 

and IB. Such a break may occur, for example, along a node temporary breakage occurs at any one of the locations break 

port of node A or along a corresponding link 110 as 40 #1» break #2 and/or break #3, the redundant type network of 

illustrated by break #1. Similarly, a break may occur at the FIG. 3 may contain some types of arrangement for detecting 

centralized switch 100 as illustrated by break #2. Likewise, a problem within the redundant network, 

a break may occur along a node port of node D or along a For example, when node A sends data to node D, node A 

corresponding link 114 as illustrated by break #3. As shown expects an acknowledgment of receipt of such data from 

in FIGS. lA and IB, when any of these example breaks #1, 45 node D within a predetermined time period. If such an 

#2 and/or #3 occurs anywhere within the data communica- acknowledgment does not occur within a predetermined 

tion network there is no way of transmitting data in groups time period, node A assumes that cells (or data packets) 

of cells to an affected node. which were not acknowledged were never received by node 

Turning now to FIGS. 2-6, example redundant types of D, and further assumes that there is a problem occurring 

data communication networks having redundant links or 50 along the previous route selected to transmit the data, 

alternate routes according to an embodiment of the present Accordingly, node A switches to an alternate predetermined 

invention are illustrated. These redundant types of data route for re-routing the cells around the failed link(s) in 

communication networks may be used to ensure that data in order to ensure that data are successfully delivered to 

groups of cells (or data packets as per InfiniBand™ Archi- respective destination nodes in the network. In the present 

tecture Specification) are successfully transmitted to respec- 55 situation, node A attempts to retransmit the non- 

tive destination nodes without cell duplication or stale cells acknowledged cells through the switch 200' (SW2) to node 

in the event of a break of some links or components in the D. However, a problem occurs if node A has re-transmitted 

network. Before beginning to discuss the operation of the a series of cells and then the temporary blockage or breakage 

redundant type of data communication network, discussion becomes free and then node D begins to receive redundant 

first turns to FIG. 2 which describes the difference between 60 or duplicate cells along or from both of the switches 100' and 

a packet and a cell of data transmitted from a source node to 200' (SWl and SW2). These redundant or duplicate ceUs are 

a destination node through switches and/or other interme- known as stale cells and need to be discarded from the 

diate nodes. As shown in FIG. 2, a packet may represent a network for reliable and efiScient cell relay operations, 

large data entity of user data and may be implemented as The present invention seeks to simply and efiBciently 

four (4) gigabytes minus one (1) in addition to cell header 65 detect and discard these redundant or duplicate stale cells 

information and cyclic redundancy check (CRC) informa- from the network for reliable cell relay operations by using 

tion. In contrast, a cell may be a subset of a packet or via a "cell tagging" technique for each communication flow 
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between a source node and destination node individually. acknowledged from a destination node or node D, and then 

More particularly, the header HDR of the cell (or individual re-route each tagged cell to the destination node or node D 

data packet) which contains data pertaining to cell tagging as through a redundant or alternate route (link) which is 

shown in FIG. 2 may be used by each switch or node within different from the previously detected problematic route 

the redundant type network to determine which cells are 5 based on its predetermined listing. In the example as 

tagged and not tagged for discarding purposes. Accordingly, described with reference to FIGS. 3 and 4, node A may 

upon occurrence of detection of a route problem, a source retransmit all cells which were not acknowledged by desti- 

node or node A as shown in FIGS. 3 and 4 may automatically nation node D by way of transmitting the tagged cells to 

enter a cell tagging mode as part of an internal re-route node D through second switch 200' (SW2). Upon expiration 

algorithm, and remain in such a cell tagging mode for cell °^ tagging interval, the cell tagging mode of a source 

tagging during a predetermined cell tagging interval. A node or node A may be terminated automatically based on a 

destination node or node D may then receive tagged cells buih-in internal timer (not shown), 

and, upon receipt of a first tagged cell, automatically enter When a first tagged cell transmitted from node A arrives 

a reject untagged cell mode as part of an internal re-route at node D, node D may read the cell tagging information 

algorithm, and remain in such a reject untagged cell mode from the header of the tagged cell and immediately enter a 

for one half (^) the cell tagging interval (for example) so as reject untagged cell mode for one half (Yt) the cell tagging 

to discard all received cells that are not tagged (stale cells). interval, that is, for 30 microseconds. Thereafter, destination 

The period of the cell tagging interval may be set relative to node or node D may assume for one half (Vi) cell tagging 

the maximum delay time for a cell to exist in all routes interval that only the received cells having the activated cell 

within a switch fabric of the redundant type network before 20 ^g©"g ^^e valid cells, and then promptly discard any 

being either delivered to a destination node or discarded redundant cells which may have received from the source 

from the network. This is a period known as a fabric lifetime node or node A through first switch 100' (SWl). In other 

within a switch fabric of the redundant type network that is words, the destination node or node D may ignore all 

determined by each switch within the switch fabric to insure untagged cells which may have received from the source 

that all stale cells are exited from the switch fabric. For 25 node or node A at input ports. 

purposes of this disclosure, the period of the cell tagging Upon expiration of one-half (^) cell tagging interval, the 

interval may be set as twice the fabric lifetime (for example) reject untagged cell mode of a destination node or node D 

to ensure that there may be no overlap between the cell may be terminated automatically based on a built-in internal 

tagging mode of the source node and the reject untagged cell timer (not shown). Each destination node such as node D 

mode of the destination node. However, it should be noted 3Q described with reference to FIGS. 3 and 4 may further 

that the present invention is not limited to such intervals. The include a mechanism which provides a source node or node 

cell tagging interval may be set as any discrete value greater A an indication that previously clogged or redundant cells 

than the fabric lifetime, as long as the destination node D, as are now being received again, and that the temporarily 

shown in FIG. 3, for example, may be assured to be removed blocked/broken network link is now fixed. That way the 

from the reject untagged cell mode before the source node 35 source node or node A may stop cell tagging and return to 

A, as shown in FIG. 3, for example; stops tagging incoming using the previously broken, but now fixed network link for 

cells in the next cell tagging mode to avoid mode overlap- data communications. As a result, the redundant cells can be 

ping. A link failure or route problem within the network may simply and effectively discarded from the network and the 

be determined, for example, by a control mechanism of a use of redundant cells can be completely avoided, thus 

source node A utilizing a time-out of a response or by a guaranteeing an integrity of the network, 

central network manager managing general faults or link piG. 5 illustrates an example implementation of a redun- 

failures of the network. dant data communication network having an example input/ 

More particularly, FIG. 3 is a very simplistic illustration output (I/O) channel architecture for data communications 

of a redundant type network for the sake of clarity and according an embodiment of the present invention. As 

brevity. Instead of just two switches 100' and 200' (SWl and 45 shown in FIG. 5, the redundant data communication network 

SW2), in fact, the network may include many switches. For includes a multi-stage switch 10' comprised of a plurality of 

example, as indicated by the dashes within the redundant switches, including, for example, first and second switches 

type network, the path from source node A to destination 100' and 200' (SWl and SW2) of FIG. 3 for allowing host 

node D through the switch S2 may actually include addi- systems and target systems to communicate to a large 

tional switches SW3 and SW4. 50 number of other host systems and target systems. In 

Turning now to FIG. 4, and assuming that each of the addition, any number of end stations, switches and links may 

switches SW3, SW2 and SW4 which are cascaded as shown be used for relaying data in groups of ceUs between the end 

incurs a delay of, for example, 10 microseconds, a total stations and switches via corresponding links. For example, 

maximum delay through the switching fabric would be 30 node A may represent a host system 320 as shown in FIG. 

microseconds to transmit data from source node A to des- ss 5. Similarly, node B may represent another network, 

tination node D. The present invention then sets the cell including, but not limited to, local area network (LAN), 

tagging interval to approximately twice the maximum delay Ethernet, ATM and fibre channel network. Node C may 

(fabric lifetime) or two limes 30 microseconds equals to 60 represent an input/output (I/O) device 360. Likewise, node 

microseconds of a cell tagging interval which is effected at D may represent a remote system 380 such as a computer or 

node A. In contrast, the receiving node or node D only enters 60 a server. Alternatively, nodes A, B, C, and D of FIG. 3 may 

the cell tagging interval (upon receiving any first tagged also represent individual switches of the multi-stage switch 

cell) for only during the fabric lifetime, that is, one -half (Vi) 10* which serve as intermediate nodes between the host 

of the maximum cell tagging interval or 60 microseconds system 320 and the target systems 340, 360 and 380. 

divided by two equals the 30 microseconds. The multi-state switch 10' may include a central network 

When node A is in a cell tagging mode, node A may insert 65 manager 150 connected to all the switches for managing all 
information data (e.g., a special code) pertaining to cell network management functions. However, the central net- 
tagging into the header HDR of each cell which was not work manager 150 may alternatively be incorporated as part 
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of either the host system 322, the second network 340, the database 202. The switch manager 204 may also manage the 

I/O device 360, or the remote system 380 for managing all forwarding or relaying of cells between links using either 

network management functions. In either situation, the ccn- destination address routing, explicit routing or a combina- 

tral network manager 150 may be configured for learning tion of the two. In addition, when serving as an intermediate 

network topology, determining the switch table or forward- 5 node between a source node and a destination node, the 

ing database, detecting and managing faults or link failures switch manager 204 may also control the re-route or cell 

in the network and performing other network management tagging operation in the event of a break of some links or 

functions. When there may be faults or link failures in the components in the network in order to insure that all stale 

network, the central network manager 150 may alert the cells are discarded from the switch fabric in the manner 

source node or node A to automatically enter a cell tagging described with reference to FIGS. 3-4. 

mode for cell tagging purposes, that is, for re-routing or For example, if a source node and a destination node of 

re-transmitting the same cells to a destination node or node the example redundant data communication network repre- 

D around the failed links where alternate links exist in the sent a host system 320 and a remote system 380 respectively 

network while discarding redundant or stale cells from the as shown in FIG. 5, then the host system 320 may be set to 

switch. automatically enter a cell tagging mode in response to 

Separately, the central network manager 150 may also be detection of a link failure or route problem within the 

used to determine the maximum delay time (fabric lifetime) network, and the remote system 380 may be set to auto- 

that a cell may exist in the switch fabric before being either matically enter a reject untagged cell mode in response to 

delivered to a destination node or discarded from the switch receipt of a first tagged cell from the host system 320. A link 

fabric. As described with reference to FIGS. 3 and 4, the 20 failure or route problem within the network may be 

fabric lifetime may be utilized to insure that all stale cells are determined, for example, by either a time-out mechanism of 

exited from the switch fabric. The fabric lifetime may be the host system 320 or by the central network manager 150 

determined by first calculating a switch cell lifetime, that is, of the multi-stage switch 10'. If the central network manager 

the time a cell may exist in a given switch, then calculating 150 is configured to inform the source node or host system 

the maximum time a cell may exist in all routes within the 25 320 of the link failure or route problem of the network, then 

switch fabric. the time-out mechanism may not be needed for the source 

A host channel adapter (HCA) 322 may be used to provide node or host system 320 to detect such a link failure. Other 

an interface between a memory controller (not shown) of the link failure detection mechanisms may also be used instead 

host system 320 and a multi-stage switch 10' via high speed of the time-out mechanism of the host system 320 or the 

serial links. Similarly, target channel adapters (TCA) 342, 30 central network manager 150 of the multi-stage switch 10' as 

362 may be used to provide an interface between the long as link failures of the network are detected for cell 

multi-stage switch 10' to an 1/0 controller of either an I/O tagging purposes. 

device 360 or a remote system 380 via high speed serial The host system 320 may contain a built-in timer (not 

links. Separately, another host channel adapter (HCA) 382 shown) set for cell tagging during a cell tagging interval, 

may also used to provide an interface between a memory 35 Upon expiration of the cell tagging interval, the host system 

controller (not shown) of the remote system 380 and the 320 may be automatically reverted from the cell tagging 

multi-stage switch 10' via high speed serial links. mode back to normal operation. Likewise, the remote sys- 

FIG, 6 is a block diagram of an example individual switch tem 380 may contain a built-in timer (not shown) set for 

of a multi-stage switch 10' of an example redundant data rejecting untagged cells (i.e., stale cells) during a reject 

communication network according to an embodiment of the 40 untagged cell mode having a period one half (V^) the cell 

present invention. Each switch 100' or 200' of the multi- tagging interval. Again, the cell tagging interval may be set 

stage switch 10' may include a switch table or forwarding as twice the fabric lifetime to ensure that there may be no 

database 202, a switch manager 204, a memory 206, a relay overlap between the cell tagging mode of the source node 

function imit 208, a plurality of switch ports, including ports and the reject untagged cell mode of the destination node. 

1-n (shown as ports 220, 222 and 224), and a plurality of 45 However, any discrete value greater than the fabric hfetime 

receive (Rx) and transmit (Tx) queues 230-240. A switch may be suitable as long as the destination node may be 

manager 204 may be a logic or processor used for managing assured to be removed from the reject untagged cell mode 

all functions of a switch. For example, the switch manager before the source node stops tagging incoming cells in the 

204 may relay cells between the host system 320 and the next cell tagging mode to avoid mode overlapping for each 

target systems 340, 360, 380 and, if serving as an interme- 50 communication flow between the source node and the des- 

diate node within the network, may re-route or re-transmit tination node of the network infrastructure, 

the same cells to target systems 340, 360, 380 around the The remote system 380 may also contain a mechanism 

failed links in the event of a break in the links or components which informs the host system 320 that the temporary link 

of the network while discarding redundant or stale cells from failure in the network is now fixed. That way the host system 

the switch fabric. A memory 206 may optionally contain a 55 380 may stop cell tagging and return to using the previously 

re-route or cell tagging information for re-routing cells broken, but now fixed network link for data communica- 

around the failed links where alternate links exist in the tions. As a resuU, the redundant cells can be simply and 

network when serving as an intermediate node between a effectively discarded from the network and the use of 

source node and a destination node. A relay function unit redundant cells can be completely avoided, thus guarantee- 

208 may be provided for relaying received cells to specific 60 ing an integrity of the network. 

destination ports. The receive (Rx) and transmit (Tx) queues As described from the foregoing, the present invention 

230-240 may be connected between the relay function unit advantageously provides a unique cost-effective and 

268 and each switch port 220, 222 and 224. performance-eflBcient solution for recovery from faults or 

The switch manager 204 may communicate with the link failures within a network infrastructure which requires 

central network manager 150 as shown in FIG. 5 to receive 65 neither a quiet period, nor that the transmitting endpoint be 

information for switch initialization (e.g., to receive the aware that the route through the network infrastructure has 

switch address) and to receive and download the forwarding changed for data communications. Switch elements of a 
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multi-stage switch permit routing and alternate routing of 
data in groups of cells between endpoints efficiently on an 
individual communication flow basis. 

While there have been illustrated and described what are 
considered to be exemplary embodiments of the present 5 
invention, it will be understood by those skilled in the art and 
as technology develops that various changes and modifica- 
tions may be made, and equivalents may be substituted for 
elements thereof without departing from the true scope of 
the present invention. For example, the present invention is lO 
applicable to all types of redundant type networks, 
including, but not limited to, Next Generation Input/Output 
(NGIO), InfiniBand™ ATM, SAN (system area network, or 
storage area network), server net, Future Input/Output 
(FIO), fiber channel, Ethernet). Many modifications may be 15 
made to adapt the teachings of the present invention to a 
particular situation without departing from the scope 
thereof. Therefore, it is intended that the present invention 
not be limited to the various exemplary embodiments 
disclosed, but that the present invention includes all embodi- 20 
ments faUing within the scope of the appended claims. 

What is claimed is: 

1. A network, comprising: 

a transmitter to transmit data along a route of a network 
and, upon detection of a failure in a route of the 25 
network, retransmit data tagged as resent data along a 
different route of the network; 

a receiver to receive data transmitted from the transmitter 
and, upon detection of tagged data, utilize said tagged 
data for data communications, while discarding data ^® 
that are not tagged to avoid data duplication; and 

a switch fabric which comprises a plurality of different 
routes for connecting said transmitter and said receiver 
for data communications; 

wherein said data is transmitted in groups of cells with 
each cell having a header utilized for tagging by said 
transmitter upon detection of a failure in a route of the 
network; and 

wherein said transmitter automatically switches to a cell ^ 
tagging mode to tag unique information on said data in 
groups of cells individually and retransmit said tagged 
data as resent data along a different route of the switch 
fabric to said receiver during a cell tagging interval. 

2. A network as clainaed in claim 1, wherein said cell 
tagging interval is set to twice a maximum delay time that 
a cell exists in the switch fabric before being delivered to 
said receiver or discarded from the switch fabric. 

3. A network as claimed in claim 2, wherein said trans- 
mitter comprises a timer which terminates said cell tagging 
mode upon expiration of said cell tagging interval. 

4. A network, comprising: 

a transmitter to transmit data along a route of a network 
and, upon detection of a failure in a route of the 
network, retransmit data tagged as resent data along a 
different route of the network; 

a receiver to receive data transmitted from the transmitter 
and, upon detection of tagged data, utilize said tagged 
data for data communications, while discarding data 
that are not tagged to avoid data duplication; and 

a switch fabric which comprises a plurality of different 
routes for connecting said transmitter and said receiver 
for data commimications; 

wherein said data is transmitted in groups of cells with 
each cell having a header utilized for tagging by said 65 
transmitter upon detection of a failure in a route of the 
network; and 



35 



wherein said receiver automatically switches to a reject 
untagged cell mode, upon receipt of a first tagged cell 
of said resent data retransmitted from said transmitter, 
to discard all previously transmitted data that are not 
tagged by said transmitter to avoid data duplication for 
a duration that is one-half (V4) of said cell tagging 
interval. 

5. A network as claimed in claim 4, wherein said receiver 
comprises a timer which terminates said reject untagged cell 
mode upon expiration of the duration that is one-half (H) of 
said cell tagging interval. 

6. A network arrangement, comprising: 
a host system; 

at least one remote system; 

a switch fabric comprising a plurality of different switches 
which interconnect said host system to said remote 
system along different routes for data communications, 
and a central network manager connected to the dif- 
ferent switches for managing network management 
functions, including detecting and managing link fail- 
ures along different routes between said host system 
and said remote system; 

wherein said host system, upon detection of a link failure 
in a route to said remote system via said switch fabric, 
tags data previously transmitted to said remote system 
and transmits tagged data as resent data along ao 
alternate route to said remote system; 

wherein said remote system, upon detection of tagged 
data transmitted from said host system via said switch 
fabric, utilizes said tagged data for data 
communications, while discarding previously transmit- 
ted data that are not tagged to avoid data duplication; 

wherein said data is transmitted from said host system to 
said remote system in groups of cells with each cell 
having a header utilized for tagging by said host system 
upon detection of a link failure in the route to said 
remote system via said switch fabric; and 

wherein said host system automatically switches to a cell 
tagging mode to tag unique information onto said data 
in groups of cells individually and transmit said tagged 
data as resent data along an alternate route to said 
remote system during a cell tagging interval. 

7. A network arrangement, comprising: 
a host system; 

at least one remote system; 

a switch fabric comprising a plurality of different switches 
which interconnect said host system to said remote 
system along different routes for data communications, 
and a central network manager connected to the dff- 
ferent switches for managing network management 
functions, including detecting and managing link fail- 
ures along different routes between said host system 
and said remote system; 

wherein said host system, upon detection of a link failure 
in a route to said remote system via said switch fabric, 
tags data previously transmitted to said remote system 
and transmits tagged data as resent data along an 
alternate route to said remote system; 

wherein said remote system, upon detection of tagged 
data transmitted from said host system via said switch 
fabric, utilizes said tagged data for data 
communications, while discarding previously transmit- 
ted data that are not tagged to avoid data dupHcatioD; 

wherein said data is transmitted from said host system to 
said remote system in groups of ceUs with each cell 
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having a header utilized for lagging by said host system 
upon detection of a link failure in the route to said 
remote systeno via said switch fabric; and 
wherein said cell tagging interval is set to twice a maxi- 
mum delay time that a cell exists in said switch fabric ^ 
before being delivered to said remote system or dis- 
carded from said switch fabric. 

8. A network arrangement as claimed in claim 7, wherein 
said host system comprises a timer which terminates said 
cell tagging mode upon expiration of said cell tagging 
interval. 

9. A network arrangement, comprising: 
a host system; 

at least one remote system; 

a switch fabric comprising a plurality of different switches 
which interconnect said host system to said remote 
system along different routes for data cocamunications, 
and a central network manager connected to the dif- 
ferent switches for managing network management 20 
functions, including detecting and managing link fail- 
ures along different routes between said host system 
and said remote system; 

wherein said host system, upon detection of a Unk failure 
in a route to said remote system via said switch fabric, 25 
tags data previously transmitted to said remote system 
and transmits tagged data as resent data along an 
alternate route to said remote system; 

wherein said remote system, upon detection of tagged 
data transmitted from said host system via said switch 30 
fabric, utilizes said tagged data for data 
communications, while discarding previously transmit- 
ted data that are not tagged to avoid data duplication; 

wherein said data is transmitted from said host system to 
said remote system in groups of cells with each cell 35 
having a header utilized for tagging by said host system 
upon detection of a link failure in the route to said 
remote system via said switch fabric; and 

wherein said remote system automatically switches to a 
reject untagged cell mode, upon receipt of a first tagged 
cell of said resent data transmitted from said host 
system via said switch fabric, to discard all previously 
transmitted data that are not tagged by said host system 
to avoid data duplication for a duration that is one -half 
(V2) of said cell tagging interval. 

10. A network arrangement as claimed in claim 9, wherein 
said remote system comprises a timer which terminates said 
reject untagged cell mode upon expiration of the duration 
that is one-half of said cell tagging interval. 

U. A network arrangement as claimed in claim 6, wherein 
each switch of said switch fabric comprises: 

a plurality of ports, each port cormected to a link; 
a receive queue and a transmit queue coupled to each port; 
a forwarding database for storing cell routing information; 
a relay entity coupled to the queues and the forwarding 
database, for obtaining cell routing information &om 
the forwarding database and relaying the cell to the port 
specified by the cell routing information; and 
a management entity coupled to the relay entity, for 60 

routing the cell to said remote system, 
12. A method for detecting and discarding stale cells 
following route changes in a data communication network 
comprising at least a source node, a destination node, and a 
multi-stage switch comprised of a plurality of different 65 
routes connecting said source node to said destination node, 
said method comprising: 
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transmitting, at said source node, data in groups of cells 
to said destination node alone a route of said multi- 
stage switch; 

detecting a link failure along the route of said multi-stage 
switch between said source node and said destination 
node; 

upon detection, at said source node, of said link failure 
along the route of said multi-stage switch between said 
source node and said destination node, tagging unique 
tagging code onto a header of each cell of previously 
transmitted data for a cell tagging interval, and trans- 
mitting tagged cells as resent data to said destination 
node along an alternate route of said multi-stage 
switch; and 

upon detection, at said destination node, of a first of 
tagged cells transmitted as resent data from said source 
node via said multi-stage switch, discarding all 
untagged cells for a reject untagged cell interval and 
utilizing said tagged cells for data communications; 

wherein said cell tagging interval, corresponds to twice a 
maximum delay time that a cell exists in said multi- 
stage switch before being either delivered to said 
destination node or discarded from said multi-stage 
switch, and said reject untagged cell interval corre- 
sponds to one-half QA) of said cell tagging interval, 

13. A network comprising: 

a source node to transmit data; 

a destination node to receive and acknowledge data 
transmitted from the source node; and 

a switch fabric including a plurality of switches to estab- 
lish connection links between the source node and the 
destination along different routes for data communica- 
tions; 

wherein the source node is configured to detect a link 
failure along a route to the destination node, to tag data 
that was not acknowledged from the destination node 
during a data tagging mode, and to transmit tagged data 
along an alternate route to the destination node; 

wherein the destination node is configured to discard, 
upon receipt of the tagged data transmitted from the 
source node, data that are not tagged during a untagged 
data reject mode; and 

wherein the source node remains in the data tagging mode 
to tag data that was not acknowledged from the desti- 
nation node for a tagging period greater than twice a 
lifetime of the switch fabric indicating a maximum 
delay time that data packets exist in the switch fabric 
before being delivered to the destination node or dis- 
carded firom the switch fabric. 

14. A network as claimed in claim 13, wherein the 
destination node remains in the untagged data reject mode 
for one-half (14) of the tagging period used by the source 
node to tag data for transmission along the alternate route. 

15. A network as claimed in claim 13, wherein the data is 
tagged by prepending a special code to a header of each 
individual data packet during the data tagging mode prior to 
transmission to the destination node along the alternate 
route, 

16. A method for detecting and discarding stale data 
packets following route changes in a data communication 
network comprising at least a source node, a destination 
node, and a switch fabric comprised of a plurality of 
switches to establish connection links between the source 
node and the destination along different routes for data 
communications, said method comprising: 
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transmitting, at the source ncKle, data in groups of packets 
along a route to the destination node; 

detecting a link failure along the route between the source 
node and the destination node; 

upon detection, at the source node, of the link failure ^ 
along the route between the source node and the 
destination, tagging a unique code onto each data 
packet of previously transmitted data for a data tagging 
period, and transmitting tagged data packets to the 
destination node along an alternate route; and upon 
detection, at the destination node, of a first of tagged 
data packets transmitted from the source node, discard- 
ing all untagged data packets for an untagged data 
reject period and utilizing said tagged data packets for 
data communications; 
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wherein the data tagging period is greater than twice a 
lifetime of the switch fabric indicating a maximum 
delay time that data packets exist in the switch fabric 
before being delivered to the destination node or dis- 
carded from the switch fabric. 

17. A method as claimed in claim 16, wherein the 
untagged data reject period is one-half (Vi) the data tagging 
period used by the source node to lag data packets for 
transmission along the alternate route. 

18. A method as claimed in claim 16, wherein the source 
node contains a timer configured to terminate the tagging of 
individual data packets, upon expiration of the data tagging 
period. 

4^ 4c * 4 ♦ 
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